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Abstract—This paper presents a new adaptive control ap- 
proach that involves a performance optimization objective. 
The control synthesis involves the design of a performance 
optimizing adaptive controller from a subset of control 
inputs. The resulting effect of the performance optimizing 
adaptive controller is to modify the initial reference model 
into a time-varying reference model which satisfies the 
performance optimization requirement obtained from an 
optimal control problem. The time-varying reference model 
modification is accomplished by the real-time solutions of 
the time-varying Riccati and Sylvester equations coupled 
with the least-squares parameter estimation of the sen- 
sitivities of the performance metric. The effectiveness of 
the proposed method is demonstrated by an application of 
maneuver load alleviation control for a flexible aircraft. 


I. INTRODUCTION 


In this work, we develop a _ multi-objective 
performance-based adaptive optimal control with the 
goal of providing adaptation while seeking to optimize 
a performance metric. The notion of multi-objective 
optimization in the context of model reference adaptive 
control is considered to be novel. As such, research 
in this new area is quite limited. In the recent years, 
Nguyen investigates multi-objective adaptive control 
[1], [2]. In this work, bi-objective optimal control 
modification seeks to minimize both the tracking error 
and the predictor error which are used in the adaptation 
for systems with input uncertainty [1]. A further 
extension led to the development of multi-objective 
optimal control modification which seeks to minimize 
the effect of unmatched uncertainty [2]. Multi-objective 
optimal control has been developed by Nguyen to 
address drag minimization, flutter suppression, and 
maneuver and gust load alleviation for a flexible wing 
aircraft by taking advantage of the redundancy in 
multi-functional flight control surfaces [3]. With a 
similar objective, Wise et al. develop a direct adaptive 
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reconfigurable flight control for a tailless aircraft [4] 
with maneuver load alleviation addressed in a control 
allocation scheme. 

In the current work, we propose a new model- 
reference adaptive control that incorporates optimization 
of a performance metric of an uncertain plant. The plant 
is assumed to have control redundancy to provide multi- 
objective adaptive control tasks: 1) model-reference 
adaptive control for managing system uncertainties and 
2) performance optimization. The performance metric is 
available from sensor measurements but its sensitivities 
with respect to the state and control variables are un- 
known but can be estimated by a least-squares parameter 
estimation method. The performance optimization is cast 
as a multi-objective optimization problem. This results in 
time-varying modified Riccati and Sylvester equations. 
The performance optimizing adaptive controller obtained 
from the solutions of the time-varying Riccati equation 
and Sylvester equation results in time-varying control 
gains. The closed-loop plant thus is time-varying. As 
a result, the linear time-invariant reference model must 
be modified accordingly to be time-varying so that 
asymptotic tracking can be achieved. The time-varying 
reference model modification is another novelty of the 
current work. Reference model modification in model- 
reference adaptive control has been investigated exten- 
sively by many researchers but time-varying modification 
is generally not considered in these investigations. Stepa- 
nyan and Lavretsky independently develop a closed-loop 
reference model modification that contains a tracking 
error feedback term in the reference model which is 
linear time-variant [5], [6]. Gibson investigates a closed- 
loop reference model modification originally developed 
for the observer-based output feedback adaptive control 
proposed by Lavretsky [7] for output feedback systems. 
The closed-loop reference model includes an output 
feedback term, but overall is a linear time-invariant 
reference model [8]. 


II. PERFORMANCE OPTIMIZING ADAPTIVE CONTROL 
A. Problem Formulation 


Consider a plant model 
F=Ar+B [ut OT @()| (1) 


subject to x(0) = xo, where x(t) € R” is the state 
vector, u(t) € R” is the control vector, @* € R!*” is 


an unknown constant matrix that represents a matched 
uncertainty, (x) € R! is a known regressor function, 
and A € R”’*” and B € R”*” are known matrices. The 
plant is considered to be a full-state system with all the 
available states for control. Therefore, the pair (A,B) is 
assumed to be controllable. The plant is associated with 
a performance metric 


y=Cx+Du (2) 


where y(t) € IR‘, where k <n, is a vector of performance 
metrics for the plant which is available from measure- 
ments, and C € R**” and D € R**” are sensitivity 
matrices which are assumed to be unknown unknown. 
The initial design of an adaptive controller without the 
performance optimization is to establish a linear time- 
invariant reference model 


Xm = AmXn + Br (3) 


based on the closed-loop plant without the matched 
uncertainty using a nominal controller Unjom (t) given by 


Unom = Kyx + K,r (4) 


where A,, =A+BK, € R”*” is a Hurwitz state transition 
matrix of the reference model, B,, = BK, € R’”? is a 
command transition vector of the reference model, and 
r(t) € R@ is a bounded reference command signal with 
a bounded derivative, i.e., r(t) € Z and 7 (t) € Z and 
the assumption that g <m . The reference command 
signal is generated by the following implementation 


Z=A,-Z4+ 2% (5) 
r=C,z (6) 


subject to z(0) = zo, where z(t) ER’, p>q, z-E R? isa 
constant command signal, A; € R?*? is a stable matrix, 
and C, € R?*?. Without the performance optimization, 
an adaptive controller can be designed straightforwardly 
with 

Uu = Unom + Uad (7) 


where ugq (t) is an augmented adaptive controller given 
by wad = —@' (x) with the adaptive law 


© =—I'@(x)e'PB (8) 


where e(t) = x(t) —x(t) is the tracking error and P 
solves the Lyapunov equation 


PAn +A,,P = —O (9) 


with Q = Q' > 0 a positive-definite weighting matrix. 
To address the performance optimization, we consider 
the following adaptive controller 


(10) 


U = Unom + Up + Uad 


where u, is the performance optimizing augmentation 
controller. The plant with the adaptive controller now 
becomes 


%=Amx+Bmr + Buy + Bugg +BO*'® (x) (11) 


We re-define y(t) as an incremental performance 
metric due to the performance optimizing augmentation 
controller up (t) with 


y=Cx+Dup (12) 
Let $(t) be the estimate of y(t) where 
$=Cx4+Duy (13) 


with C(t) and D(t) being the estimates of the sensitivity 
matrices C and D, respectively. Next, the performance 
metric estimation error is computed as 


ey =$S-y=Cx+Duy (14) 
where C(t) = C(t) —C and D(t) = D(t)—D are the 
estimation errors of the sensitivity matrices C and D, re- 
spectively. C(t) and D(t) can be estimated from a least- 
squares gradient method that minimizes the following 
cost function 


(5) 


This results in the following least-squares gradient 
adaptive laws for estimating C(t) and D(t): 


ae aL 
Cl =-Ic ae —T exe; (16) 
F aJ 
Di = Tp55 = Tpupey (17) 


where Tc ='6 € R"™" > 0 and Ip = E R™" > 0 
are positive definite adaptation rate matrices. To design 
a performance optimizing control, we consider the fol- 
lowing ideal plant with u*, (t) = —O*'®@(x) to achieve 
perfect tracking: 


X= Anx+Bnr+Bup (18) 


We now cast this problem as a multi-objective opti- 
mization problem by using the following infinite-time- 
horizon cost function [3]: 

‘f 

(9° O9-+u} Rup) dt 


J mn (19) 


t fe 0 
where 0=Q' €R*®™*>0 and R=R!' €R”” > 0 are 
positive definite weighting matrices, subject to the plant 
dynamics in Eq. (18). We formulate an optimal control 
problem by establishing the following Hamiltonian func- 
tion: 


iran 
H= 53 OF + 5 up Rup +m" (Amx+Bmr+Bupy) (20) 


where p(t) € R” is an adjoint vector which acts to 
enforce the dynamic constraint imposed by the plant. 
Then, the necessary conditions of optimality are ob- 
tained as 
dH! 


w= aa —C€'Q(Cx+Duy) —Alu 


subject to the transversality condition wu (tr) = 0, and 


oH! 


(21) 


a D'Q(Cx+Duy)+Rup+B'p=0 (22) 
Up 
The optimal control u, (t) is then obtained as 
R a\—! a x 
as (r+ Bob) (B'u +B" Q¢x) (23) 


We proceed with the assumed solution of p (t) having 
the form 

M=Wx+Vz+Uz, (24) 

Upon substitution, we obtain the following equations: 

W+WA+A'W—WBR 'B'W+0=0 (25) 

V+ (a7 zd WBR'B") V+VA,+WB,C,=0 (26) 


U + (AT —waR'B") U+V=0 (27) 


subject to the transversality conditions W (tr) = 0, 
V (tr) = 0, and U (tr) = 0, where 


A=A, —BR'B' oC (28) 
0=-CoQ (1- BR 'D"0) (6 (29) 
R=R+D'QD (30) 

Q is chosen such that 0 > 0 which implies 
DR'D'O<1 (31) 


Equation (25) is recognized as the time-varying differen- 
tial Riccati equation with the time-varying matrix A (t) 
and time-varying positive definite weighting matrices 
Q(t) and R(t) which are updated at each time step 
as C(t) and D(t) are computed from the least-squares 
gradient adaptive laws. The existence of the solution of 
a time-varying differential Riccati equation depends on 
the properties of the time-varying matrices A(t), Q(t), 
and R(t) according to the following theorem: 
Theorem 1: Let A(t) = A* +6; (t), O(t) = OF + 
5g (t) > 0, and R(t) = R* + dg(t) > O where A*, Q*, 
and R* are some constant matrices. If 6; (¢), 59 (t), 
and 6g (t) are piecewise continuous for all ¢ € [0,c); 
lim; sco Oj (t) = 0, limy—co 56 (t) =0, and lim,-,.. dg (t) = 
0; and furthermore, 6; (t), do (t), and da (t) are uni- 
formly continuous; then A(t), Q(t), and R(t) tend to 
their constant solutions A*, O*, and R*, respectively, as 
t —> co, Consequently, the solution of the time-varying 


differential Riccati equation exists and also tends to its 
constant solution in the limit as ty + ©. 

Proof: If 6; (t), 6g(¢), and 6g (t) satisfy the con- 
ditions in Theorem 1, then according to the Bar- 
balat’s lemma lim;_,.. 55 (t) =0, lim; do (t) =0, and 
lim,s.0 dg (t) = 0. A function f(t) that satisfies the 
conditions lim,..f (t) =0 and lim,,.. f(t) = 0 exhibit 
convergence properties such as an exponential decay 


function. Therefore, A (t) + A*, Q(t) > O*, and R(t) = 
R* as t 3 &. 

Let W (t) = W* + dy (t) be the solution of the time- 
varying Riccati equation. Then, 


. * q* 7* T * 
dw + (W* + Sw) (A* + 53) + (A* +63) (W* + dy) 
— (W* + dy) B(R* +5) |B" (W* + by) 
+(0* +65) =0 (32) 
Separating terms and taking the limit as t — oo yield 
W*A* +A*' w* —wW*BR*|'B'w*+0*=0 (33) 


dy + dwA* +A"! Sw — SwBR*'B' dy 
— 6yBR*-'B' w* —W*BR*'B' Sy =0 (34) 


Now transforming Eq. (34) into the time-to-go vari- 
able t =ty —t gives 


— 280 Sy" +A" By — by BR BT by 


dT 
— by BR*-'B' w* —W*BR*'B' Sy =0 (35) 


subject to Sw (t =0) =0 since W (tr) =0. The solution 
of Eq. (34) yields d6w =0 and dy =0 as tp > ~. Thus, 
W (t) > W* as tp + © which implies W (r) + 0. 


Equation (26) is recognized as the time-varying differ- 
ential Sylvester equation. The existence of the solution 
of Eq. (26) can be established from Theorem 1. If 
W (t) + W*, then the closed-loop state transition matrix 
A(t) —BR-!(t)B'W/(t) tends to a constant Hurwitz 
matrix A, = A* — BR*—'B'W*. Equations (26) is trans- 
formed into the time-to-go variable as 


dv* 
dt 


subject to V*(t =0) =0. Since A; is a stable matrix, 
and A, can be chosen such that A; (A;) +A; (A,r) £0 for 
all i and j, then it follows that V*(t) is bounded as 
ty — co and tends to a constant solution of the following 
algebraic Sylvester equation: 


=A] V*+V*A,+WBmC; 


(36) 


ALV*+V*A,+W*BnC, =0 (37) 


Equation (27) is transformed into the time-to-go vari- 


able as 
dU* 


dt 
subject to U*(t =0) =0. Since V*(t) exists, U* (t) is 
bounded as t¢ — and tends to the following solution: 
Uu* =—A,'Vv* (39) 


Thus, the solutions of Eqs. (25), (26), and (27) are 
computed from their corresponding time-varying alge- 
braic equations. The performance optimizing controller 
Up is then expressed as 


=A! U*+Vv* (38) 


Up = Ky (t)x+K; (t)z+Kz,. (t) zp (40) 

where 
R= Ro (B'w +B" 9C) (41) 
K,=—R'B'v (42) 
K,, =R'B'AD'V (43) 


Note that the uniform continuity condition for 6; (t), 
do (t), and 8g (t) requires the parameter convergence of 
the estimates C(t) and D(t). This convergence can be 
stated in the following theorem: 

Theorem 2: If the reference command signal r (ft) is 
persistently exciting, then C(t) and D(t) converge expo- 
nentially to their true values and the uniform continuity 
condition for 6; (t), do (t), and g(t) is thus satisfied. 

Proof: Choose a Lyapunov candidate function 


V(6,B) =wace(Cre'7+15'B") (44) 
Then, V (C ,D) is evaluated as 
Vv (C,D) — —2e} Cx - 2e, Dup 

=~2(Cx+Bup)' (Cx+Dup) =—2eJey <0 (45) 


Therefore, C(t) € ANY, and D(t) € ANG, but 
x(t) € Lo and up (t) € Lo. It follow that Ky (t) € Za, 
K,(t) € Lo, and K,, (t) € Yo. Therefore, for the ideal 
plant, x(t) € Lo. 

We note that V (C,D) is bounded since x(t) € LZ, and 
lip (t) € Zo by the virtue of z(t) € LZ, Kx(t) € Z, 
K.(t) € Za, and K,, (t) € LZ. Therefore, V(C,D) is 
uniformly continuous. Invoking the Barbalat’s lemma, 
V (C,D) — 0 as t +. This implies ey (t) + 0 as t > ©. 
However, this does not necessarily imply that C(t) and 
D(t) tend to zero since x(t) and up (¢) can be zero signals 
after some finite time interval. 

Since x(t)and u,(t) depend continuously on r(f), 
then the persistent excitation condition is satisfied if the 
reference command signal r(t) is persistently exciting: 


+f | x 
T J; Up 


lie ul Jac>al (46) 


for all t >) and some @>0.LetA'=[C Dj, T= 
diag ([.,I'p), and Y (x,upy) = [ x! uh ile Then, 


V(C,D) = —2! (x, up) AAP" (x, up) 


2V (C,D) [SY (x, up) |” 
a Amax (Fy) 


(47) 


Since || (x,up)||77 = Sou) VGa tS 
WY (x,upy)P' (x,up), it follows that 
ant 2aV (C,D) 
V(C,D) <—- (48) 
( ) Avion (Ex) 


This implies 


Z Vo at 
Cc 49 
MieVma men) © 
2 Vo at 
D 50 
[Pls matty? ( mits) oe 


where Vo = V(C(0),D(0)). Therefore, C(t) and D(t) 
converge exponentially to zero as t —+ 00. Note that C(t) 
and D(t) satisfy the uniform continuity condition. Then, 
it follows that 5, (r), dg (¢), and dp (t) are also uniformly 
continuous. 


It follows from Theorem | that C(t) > C and D > 
D as t+. This implies that A(t), O(t), R(t), Ky (t), 
K,(t), and K,, all converge to their ideal values A*, Q*, 
R*, K*, K?, and K*, respectively. 

The closed-loop plant with the performance optimiz- 
ing controller u, (ft) now becomes 


X= (Am + BKy)x + Bmr + K+ Kz,2+ + Buga 
+B@*'®(x) (51) 


If the parameter convergence is achieved, then the 
ideal performance optimizing reference model as 


Xm = AmXm + Bmr + BK7z + BK; z, (52) 


where A*, = A, + BK*. Let &»(t) be the estimate of 
x*,(t). Then, the time-varying performance optimizing 
reference model is given by 


Kin = (Am + BKx) £m + Bmr + BK,z+ BK;z, Zr (53) 


Thus, £ (t) + x%, (t) as t > 09. Let e (t) =f (t) — x(t) 
be re-defined as the tracking error based on the time- 
varying modified reference model. Then, the tracking 
error equation becomes 


é = (Am + ByKy)e+BO' &(x) (54) 


The MRAC update law for O(t) is based on the 
optimal control modification [9] and given by 


@ = Te (x) le'w —vo! (x) BW (Am ar B 
55) 
where v > 0 is a modification parameter for use in the 


design trade-off. By setting v = 0, we recover the new 
MRAC update law 


© = -I'@(x)e'WB (56) 


where the time-varying matrix W(t) replaces the con- 
stant matrix P in the standard MRAC update law in Eq. 
(8). 

The stability of the optimal control modification adap- 
tive law can be shown in the following proof: 

Proof: Choose a Lyapunov candidate function 


V (c,0) =eTWe+trace(OT™16) (57) 


Then, V (e,) is evaluated as 
V (e,0) = 
—e! (w +WAm+WBR,+A,W +R) BW) e 


—2v! (x)@B'W (Am +BK,) 'BO'®(x) (58) 


From Theorem 1, W (t) + 0 as ty > 00. Then, substi- 
tuting Eq. (41) into Eq. (58), we get 


V (e,6) =—e" (O+wsk'B'w) . 
—2v@! (x) OB'W (Am +BK,) | BO'® (x) < —c) |lel| 
— ver ||® (x)||? (||O|] —¢3)” + vere (x)? 59) 


where cy = inf; Amin (O* +W*BR*"'B'W*) > 0, c2 = 
inf; Amin (B'A*,— | (O* +W*BR*—'B'W*) A*—'B) > 0, 
and cj = SelB Wan allie’, 9. Then, [J (x)I] < bo 
for some 0 < V < Vinay. Thus, V (e,@) < 0 outside 
a compact set. Therefore, the closed-loop system is 
uniformly ultimately bounded. 

If v =O, then it can be shown that V (e, 0) is bounded. 
Then, according to the Barbalat’s lemma, V (e,0) >0 
which implies e(t) > 0 as t + oc, Thus, asymptotic 
tracking is achieved with the time-varying reference 
model modification by the performance optimizing con- 
troller for the new MRAC update law with v = 0. 


III. APPLICATION 


We implement a maneuver load alleviation adaptive 
control design for a flexible wing Generic Transport 
Model (GTM) equipped with the Variable Camber Con- 
tinuous Trailing Edge Flap (VCCTEF) originally pro- 
posed by NASA [10], as shown in Fig. 1. 


Figure 1. GeTM with Variable Camber Continuous Trailing Edge Flap 


Consider a linearized reduced-order longitudinal 
model of a flexible aircraft with matched uncertainty in 
the rigid aircraft states 


i, =Apx,+Bplty + Ba Ez +0 @(x,)| +A (60) 


where x, (t) € R” is a rigid aircraft state vector, up (t) € 
R”"! is a control vector due to the VCCTEF, uq(t) € R 
is a control due to the elevator, @* € R?*” is a constant 
and unknown matrix, ®(x,) € R? is a vector of known 
regressors, and A represents the effect of unmodeled 
dynamics of the elastic wing modes. The reduced-order 
plant matrix A, is assumed to be Hurwitz. For the 
maneuver load alleviation, the performance metric is 
taken to be the wing root bending moment which is 
measured from a strain gauge sensor. A virtual control 
concept is employed whereby the flap deflection is 
mapped into a mathematically smooth shape function 
whose coefficients are the virtual control variables [3]. 
Let up(t)=[ co(t) ci(t) e2(t) 03 (¢) |" be a vec- 
tor of the unknown coefficients of a cubic Chebyshev 
polynomial. We design a pitch attitude controller for 
the elevator using the reduced-order model to track the 
following second-order pitch attitude reference model: 


6m +26 On Om + 02 Om = O77 (61) 


This reference model is designed to only track a pitch 
attitude command without any knowledge of the wing 
root bending moment to be minimized during a pitch 
maneuver. We initially choose a sinusoidal reference 
command signal r(t) = O)sin@t where 6) = 20° and 
@ = 2 rad/sec to ensure the persistently exciting signal 
quality to demonstrate the parameter convergence. The 
simulation uses a full model with 65 states. The optimal 
control modification adaptive law is implemented with 
v =0.01. Using the linear asymptotic property of the 
optimal control modification [11], the closed-loop plant 
in the limit is shown to be stable with v = 0.01. 
The aircraft response of @(t) tracks the time-varying 


modified reference model very well and the amplitude of 
the performance metric y(t) is reduced by 38.8% due to 
the performance optimizing adaptive controller, as shown 
in Fig. 2. Thus, the effectiveness of the performance 
optimizing adaptive controller is demonstrated. The es- 
timated performance metric $(t) converges to the mea- 
sured performance metric y(t) extremely well. Figure 3 
shows the time histories of the adaptive parameters © (t), 
C(t), and D(t). The elements of the estimated matrices 
C(t) and D(t) converge to their steady state values close 
to the true values. The elements of © (t) do not converge 
but fluctuate near their ideal values. The optimal control 
modification slightly reduces the initial high frequency 
oscillations in the signals. 
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Control with r(t) = 6 sin ot 
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Figure 3. @(r), C(t), and D(t) with r(t) = @ sin or 


IV. CONCLUSIONS 


This paper presents a new method called performance 
optimizing adaptive control. The performance optimiza- 
tion is formulated as a multi-objective optimal control 
problem coupled with a least-squares adaptive law for 
estimating the unknown sensitivities of the performance 
metric needed. This results in time-varying Riccati and 
Sylvester equations whose solutions provide control 
gains for use in a time-varying modified reference model. 
The Lyapunov stability theory shows that the model- 
reference adaptive control with the time-varying mod- 
ified reference model to achieve the performance opti- 
mization is stable and bounded. Simulations of maneuver 
load alleviation control for a flexible aircraft demonstrate 
the effectiveness of the performance optimizing adaptive 
control. 
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